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1, INTRODUCTION 

The Modality of detection plays an important role in the diagnosis of most types of diseases. 
Artificial Intelligence and Neural networks have played an instrumental role in bridging the gap between 
errors in detection. Machine Learning based feature extractions have been applied to tasks like image 
classification and recognition of objects of same or different classes in medical images [1]. However, 
Machine Learning is not always useful for tasks which are trivial as it leads to higher computational costs and 
memory requirements of running complex simula-tions on simple models [2]. 

Ideally speaking Machine Learning algorithms should be usable by non-experts.But are typically not 
the case. There are problems of overfitting, underfit-ting, Gradient vanish-ing, Gradient exploding and 
countless other problems which lead to a nonsensical interpretation of the final results [3]. Hence it takes a 
lot of expert assumptions to design a Ma-chine Learning Algorithm. 

There are countless different types of algorithms availa-ble with each drawback and advantages. 
To Name, a few the problems of making use of HaarFea-tures is the orientation of the image. If the 
orientation of the object which it has trained for has changed 1.e. rotated or hidden partially behind some 
other object, the algorithm fails to identify the object. Algorithms like SIFT (Scale Invariant Feature 
Transform) corrects this by identifying unique key points from Differ-ence of Gaussian-based operations in 
different octaves and encodes them in vector descriptors which meet a certain sta-bility criterion. 
The Rotational invariance is done by finding the rotational assignment of the magnitude of the gradient. 
The SIFT was developed by David Lowe of UBC[4]. Other modern detection algorithms like the SSD, YO- 
LO exists [5]. These are highly complex in nature how-ever all of them aim to reduce the computational cost 
of operations on Images/Videos. The SSD is also referred to as the Single Shot MultiBox Detector [6]. It uses 
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multiple boxes over the image and performs CNN(Convolutional Neural Network) based classifica-tion 
operation on each box of various sizes. The best-chosen Box is decided after training in which the object 
resides in. The most important feature of this method is that it can utilize features across different layers 1.e. 
“Strength in Numbers’”’. 


Problem: The Viola Jones based adaptive boosting is not suitable for detecting features having non-uniform 
shapes and strucutres. Despite this drawback it 1s still a very fast and efficient algorithm 


Proposed Solution: Here we evaluate the possibility of using multiple custom handcrafted features to train 
the model and to evaluate the performance of the algorithm on images having uneven shape profile here in 
this case Red skin leisions. This could be of significant interest since cosmeticians wouldn’t have to manually 
look for such defects on the skin to assess individual severity of these outgrowths. Instead the algorithm can 
be used to quantify the extent of damage on the outgrowths of the skin. 


2. RESEARCH METHOD 
2.1. Haar Features and Adaptive Boosting 
Figure | Shows the haar masks. 


OOPS 
SOOO 


Figure 1. Shows the haar masks 


Typically, there are 4 types of Haar Features i.e. Edge Features, Line Features, and Four-Square 
Features. These Features represent certain characteristics of an image. For example, if we are to identify a 
nose in the next image, we would make use of the line or edge feature. The central budging of the nose 
represents pixels with brighter value and the sides of the nose have pixels of low intensity. Hence depending 
on the type of application, the chosen haar features are applied. The Viola-Jones finds how close the 
intensities are with the ideal case. The positive values are averaged, and the Negative values are averaged and 
subtracted. To get a certain value which can be compared with how close it is to 0 or 1. 

Figure 2 Shows the application of Haar Feature on an image of identifying a nose (From Bing). 





Figure 2. Shows the application of Haar Feature on an image of identifying a nose (From Bing) 


Traming is done by making use of positive and negative images. The positive images are of the objects 
which we want to detect, and the negative images are of arbitrary nature but should not have the object we are 
training the classifier for in the images. Typically, it is desired that there should be at least a thousand negative and 
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positive images for good training. The algorithm scales down the images to 24X24 and the features are 
scaled up [7]. 

Adaptive Boosting is a technique employed to identify strong features using weak features [8]. The relation can be 
mathematically written as 


F(x)=at(x)+af(x)..........008 +af(Xn) 


Where F(x) is the strong classifier and f(x) are weak classifier a is the weight of that classifier. 
Cascading 1s then performed. The main advantage of cascading 1s that it is very fast[9]. 

A sub window is taken and the features which the classifier is trained for are searched if the most 
important features are missing then the classifier rejects that window. It then moves with a stride of N. If the 
classifier finds some features relevant to the object it is trained for it looks for the second most relevant 
features and if it is found looks for the third until all the features are identified. If any of the feature is 
missing in one or more of the subsequent steps the sub-window is terminated. 

Intuitive Example: If we are identifying a face the algorithm first looks for eyes and if it is present it looks for 
nose and if present looks for lips .etc. This continues until it finds the features defining a face. 


2.2. Creating Handcrafted XML 

A third-party application has been used to custom create the cascade files. The application is 
designed by an algorithm Dasardh [10]. The positive images are put in the raw-data folder and the negative 
images are put in the negative folder. Create_list.batch file is then run. This creates a list of all the negative 
images with their associated file names. In the positive Folder, Object-Marker is launched which allows for 
custom cropping of Regions of Interest on each of the individual training images (positive). The areas of 
interest are cropped and saved in the form of a rectangle of coordinates (x, y, xl, yl) with the associated 
intensity values. This has an advantage that using object marker we can crop multiple ROI from one image. 
Hence results in feature augmentation 1.e. increasing the number of features form a small pool of images. 

Sample_Creations.batch file is then run which creates a vector file following which we use the file 
haar train to train the algorithm. After the cascade files are generated it is converted to XML format by 
making use of convert. Batch file, which creates the XML file with custom features. 

The Next images would show the contents and the generation of custom cascades shown in 
Figures 3-10. 


) cascades 

) negative 

) positive 
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Figure 3. The files in the third party software Figure 4. The raw data is where positive 
images are put 
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Figure 5. Positive images 
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Figure 9. Handcrafting features of interest 
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Figure 6. The list of negative images 


LJ Name Date modified Type Size 


vector.vec 04-01-2019 13:43 VEC File 4kB 


Figure 8. Generated vector file 


B.!G:A\Making your Own Cascades\dasar_haartrain\positive\objectmarker.exe 





Figure 10. Values picked up from the crop 
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3. RESULTS AND DISCUSSION 

After the cascade files are generated, we implement standard AdaBoost implementation in Open 
CV. Using the classic function detect MultiScale with set parameters of Scale 1.0001 and Neighborhood of 1. 
The Parameters were chosen which resulted in the largest number of detected bounding boxes. Figure 1 1(a) 
and Figure 11(b) shows test image and the hits and misses marked in red and blue (these have been 
thoroughly analyzed by repeated inspection to derive a rough estimate of the accuracy of the system. 








Figure 11(b). The hits and misses marked in red and blue (these have been thoroughly analyzed by repeated 
inspection to derive a rough estimate of the accuracy of the system 


A perfomance study of the suitability of adaptive boosting in red acne detection (Satyake Bakshi) 


1498 O ISSN: 2502-4752 


The model yielded very good accuracy and performance with respect to sensitivity. The number of 
True Positive Detection outnumbers the False Negative by a lot of margins hence it can be concluded the 
algorithm performs sufficiently well for detecting Red Acne or for any superficial lesion detection. 
Figure 12 Shows the sensitivity of the model. TP v/s FP Curve is shown in Figure 13. 


Sensitivity TP &FP 
0,85 80 
0,8 60 
0,7 7 ee 
0 
0,65 1 2 3 4 
0,6 
Figure 12. Sensitivity of the model Figure 13. TP v/s FP curve 


There is still an issue of multiple overlapping bounding boxes generated from the algorithm. 
This can be corrected by making use of NMS algorithm. Hence it has been shown that Cascading works 
sufficiently well for applications not only pertaining to facial recognition but also for dermatological 
applications. 


4. CONCLUSION 

Hence it is seen that Viola Jones based adaptive boosting method performs quite well with 
handcrafted features pertaining to irregular objects of interest here in this case red acnes. Henceforth it can be 
concluded that Viola Jones even though having a significant False positive rate the True Positives are quite 
accurate in most cases within the scope of this study. Hence it can be fairly assumed that this algorithm is 
quite efficient in highlighting important features form custom training as opposed to earlier claims of it not 
being able to detect uneven features. 
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